README



This folder contains the replication materials for "Validating Wordscores: The Promises and Pitfalls of Computational Text Scaling" and contains 14 files, including this file, a pdf file with supplementary analyses as referenced in the published paper, and an xls file with additional information with regards to the coding of publications in the citation analysis.



Before starting replication, make sure that the main folder is unzipped.

Then, unzip the "LBG replication manifestos.zip" "Manifestos.zip", "Parsing.zip", and "Stata code.zip" in the main folder. Do not move the contents of these folders to the main folder, but leave them in their respective folders.



The file "Replication.R" contains the code necessary to replicate the figures and tables in the article and the appendices. This file uses the "benchmarks.csv", "benchmarks_parse.csv", "citation_data.csv" and "content_validity.csv" files, and calls upon the data in the "Manifestos" and "Parsing" folders.



To replicate the figures in Appendix B, the STATA code as found in "lbg_replication_code.txt" can be used. This code can be copied and pasted into the STATA editor. The manifestos used in this code are found in the folder "LBG replication manifestos". As the code calls upon two different versions of the wordscores package as implemented in STATA, the code for these two versions can be found in the "Stata code" folder. The "lbg_replication.dta" contains the results of the analysis. Follow the instructions in the "lbg_replication_code.txt" to carry out the analysis.




